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Abstract. As two main focuses of the study of complex networks, the community 
structure and the dynamics on networks have both attracted much attention in 
various scientific fields. However, it is still an open question how the community 
structure is associated with the dynamics on complex networks. In this paper, 
through investigating the diffusion process taking place on networks, we demonstrate 
that the intrinsic community structure of networks can be revealed by the stable 
local equilibrium states of the diffusion process. Furthermore, we show that such 
community structure can be directly identified through the optimization of the 
conductance of network, which measures how easily the diffusion occurs among 
different communities. Tests on benchmark networks indicate that the conductance 
optimization method significantly outperforms the modularity optimization methods at 
identifying the community structure of networks. Applications on real world networks 
also demonstrate the effectiveness of the conductance optimization method. This work 
provides insights into the multiple topological scales of complex networks, and the 
obtained community structure can naturally reflect the diffusion capability of the 
underlying network. 
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As two main focuses of the study of complex networks, the community structure 
and the dynamics on networks have both attracted much attention in various scientific 
fields. On one hand, many methods for community detection have been proposed and 
applied successfully to some specific complex networks [Il[2i[3j|IJ|5j|6j[7J|8j|9]. Each 
method requires, explicitly or implicitly, a definition of a community from different 
perspectives, such as the centrality measures, link density, percolation theory, and 
network compression. Generally, community structure is known as the existence of 
groups of nodes such that nodes within a group are much more connected to each other 
than to the rest of the network [10] . Community structure may provide insight into the 
relation between the structure and the function of complex networks. Taking the World 
Wide Web as an example, closely hyperlinked web pages form a community and they 
often concern related topics [H] [12] • 

A well known definition for a community is via the modularity, which is proposed 
by Newman et al as a quality function for a partition of the network. Modularity is 
effective for detecting the community structure of many real world networks. However, 
as pointed out by Fortunato et al [13] . modularity suffers the resolution limit problem 
and this problem raises concerns about the reliability of the communities detected 
through the optimization of modularity. In [13] , the authors claimed that the resolution 
limit problem is attributable to the coexistence of multiple scale descriptions of the 
topological structure of the network, while only one scale is obtained through directly 
optimizing the modularity. In addition, the definition of modularity only considers the 
significance of link density from the static topological structure of network, and it is 
unclear how the modularity based community structure is correlated to the dynamics 
on network. 

On the other hand, many efforts are devoted to understanding the properties of 
the dynamical processes taking place on the underlying networks [T5l [T6] . In recent 
years, researchers have begun to investigate the correlation between the community 
structure and the dynamics on networks. For example, Arenas et al pointed out that the 
synchronization reveals the topological scale in complex networks [17] . In addition, the 
random walk on a network was also extensively studied and used to uncover community 
structure of the network [T8l IT9l [20] . In [21] [22], the random walk on a network is 
introduced for defining the distance between network nodes, and an algorithm based 
on this distance is proposed for partitioning the network into communities. In [23] , the 
authors proposed quantifying and ranking the quality of network partitions in terms 
of their stability, defined as the clustered autocovariance in the random walk process 
taking place on the network. 

In this paper, through investigating the diffusion process taking place on the 
network, we note that local equilibrium states appear before the diffusion process reaches 
the final equilibrium state. The stability of local equilibrium states can be measured by 
their duration time in the diffusion process. Then, we demonstrate that the intrinsic 
community structure is revealed by the stable local equilibrium states of the diffusion 
process. Furthermore, we show that such community structure can be directly identified 
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through the optimization of the conductance of the network, which measures how easily 
the diffusion among different communities occurs. 

We first introduce some notations used later. An undirected network G = (V, E) 
with iV nodes is often described in terms of its adjacency matrix A whose elements A xy 
denote the strength of the link connecting the nodes x and y. The strength of node x 
is denoted by s x = ^2 y A xy . For a node set V\ C V, \V±\ denotes the number of node in 
Vi, the volume of V x is defined as vol(V x ) = J2 x ev 1 and in.vol(Vi) = ^2 xeVuyeVl A *y 
is referred to as the inward volume of V±. 

We start with investigating the diffusion process which describes the dynamics of 
a random walker moving on the network. At each time t, the random walker moves 
from its current node x to one of its neighboring nodes y randomly with the probability 
p(x —> y) = A xy /s x . The dynamics of this process can be described as 



where p x (t) is the probability that the random walker resides at the node x at the time 
t, and r is a parameter controlling the rate of the diffusion process. The matrix L is the 
normalized Laplacian matrix defined as L = I — D~ 1 A, where I is the identity matrix 
and D is a diagonal matrix with its diagonal elements D xx = s x . 

For any starting node, as time proceeds, the diffusion process described in the 
equation (TTJ will definitely move towards equilibrium if the underlying undirected 
network is connected and non-bipartite [24]. When the diffusion process is at the 
equilibrium state, it satisfies the so-called detailed balance condition [25], i.e., the 
probability that the random walker walks through the nodes x and y successively is 
equal to the probability that the random walker walks through the nodes y and x 
successively. Formally, the detailed balance condition can be denoted by p x (t)p(x — > 
y) = p y (t)p(y — > x) and the reduced form is p x (t)/s x = p y (t)/s y for undirected networks. 

We explore the transients in the whole diffusion process instead of only the final 
equilibrium state. During the diffusion process on a network, it is known that the 
detailed balance condition is satisfied among highly interconnected nodes first and then, 
sequentially, among less interconnected ones, until among all the nodes. In order to 
evaluate how closely two nodes x and y satisfy the detailed balance condition at the 
time t, we introduce a measure c xy (t) as 



where (• ■ ■) averages over different realizations of the diffusion process with randomly 
selected starting nodes. In practice, a pair of nodes x and y is said to satisfy the 
detailed balance condition at the time t when c xy (t) is smaller than a given threshold. 
A set V of nodes is said to satisfy the detailed balance condition if the average value 
Sj/ev C xy(t)/\V\ of c xy (t) for each node x is smaller than the given threshold. Relative 
to the final equilibrium state, we say that the diffusion process is at a local equilibrium 
state when only several groups of nodes locally satisfy the detailed balance condition. 
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Figure 1. (color online). The matrix c xy (t) of two transients in the diffusion 
process taking place on the H13-4 network. Each value of c xy {t) is the average over 
10, 000 realizations of the diffusion process with randomly selected starting nodes. The 
parameter r — 0.01. 
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Figure 2. (color online). The diffusion dynamics on the randomized H13-4 network, 
(a) The matrix c xy (t) of a transient in the diffusion process. Each value of c xy {t) is 
the average over 10,000 realizations of the diffusion process with randomly selected 
starting nodes. The parameter r — 0.01. (b) The number of node groups satisfying 
the detailed balance condition as a function of time t. Here, the threshold for c xy {t) is 
set to be 1.0 x 10~ 4 . 



Using the matrix c xy (t), we can trace the different local equilibrium states during the 
diffusion process. 

As an example, we use c xy (t) to analyze the diffusion process on the H13-4 network, 
which is constructed according to [17]. The network has two predefined hierarchical 
levels, the first hierarchical level consists of 4 groups of 64 nodes and the second 
hierarchical level consists of 16 groups of 16 nodes. Figure Q] illustrates the c xy (t) of 
two transients corresponding to two different local equilibrium states of the diffusion 
process. The squares along the diagonal suggest that the corresponding groups of 
nodes satisfy the detailed balance condition. These node groups reveal the predefined 
hierarchical levels in the H13-4 network. For comparison, we further investigate the 
diffusion dynamics on the randomized H13-4 network, which is constructed through 
shuffling the edges of the H13-4 network used in figured] From figure [2(a), we can see 
that there is no node group locally satisfying the detailed balance. 

A phenomenon similar to that illustrated in figure [1] has also been observed in the 
synchronization process. In [T7J, the authors claimed that this phenomenon reveals the 
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topological scale of networks. The authors also pointed out that the phenomenon of 
the synchronization process is correlated with the spectrum of the Laplacian matrix 
associated with the underlying network. According to the characteristics of the 
Laplacian matrix, as pointed out in [26], the community structure revealed by the 
synchronization process is heavily affected by the heterogeneous degree distribution and 
the community size distribution. In the following, we will show that the local equilibrium 
state phenomenon is correlated with the spectrum of the normalized Laplacian matrix, 
which takes the heterogeneous degree and community size distribution into account 
and thus behaves better than the Laplacian matrix as regards clustering the nodes of 
network [26J. 

In this paper, a local equilibrium state is regarded as stable if the set of node groups 
satisfying the detailed balance condition remains unchanged for a long duration in the 
diffusion process. To investigate the stability of the local equilibrium states, we study 
the solution of equation ([I]) in terms of the normal modes <fi(t), which reads 

p*(t) =^Uriipi(t) = Y^U xi tpi(ti)e- x * rt , x = l,...,N, (3) 

i i 

where the Aj are the eigenvalues of the transpose of the normalized Laplacian matrix L, 
and U is the eigenvector matrix whose ith column is the eigenvector corresponding to 
the eigenvalue Aj. Given the starting node of the diffusion process, the initial amplitudes 
(fii(0) can be determined according to equation fl3]) due to the eigenvector matrix U 
being fixed, i.e., <fi(0) only depends on the starting node of the diffusion process. Note 
that, as pointed out following the equation (j2J), we investigate the average behavior of 
many different diffusion processes with randomly selected starting nodes. Thus, the 
choice of the starting nodes does not affect the results of the following analysis in this 
paper. Without loss of generality, we rank these eigenvalues in the ascending order 
= Ai <A 2 <■•• <Ai <... < X N . 

As time proceeds in the diffusion process, these normal modes fi{t) = 
(fi(0)e~ XiH (i 7^ 1) will decay to zero. We use Tj to denote the time when the normal 
mode fi(t) decays to zero. Formally, r, is infinite. In practice, a threshold e is usually 
used to determine when (pi(t) decays to zero, i.e., (pi(t) < e. In this case, we have 

T . = ]_ x ln<Pi(0)-lne ^ 
Ai r 

All these moments Tj (1 < i < N) form a series of time intervals, respectively 
[t n+1 = 0,t n ), [rjv,rjv-i), • •, bi+i,Ti), [r 3 ,r 2 ), [r 2 ,Ti = oo). These time intervals 
divide the whole diffusion process into iV stages. Specifically, the time interval [Tj + i, Tj) 
is regarded as the ith stage. When the diffusion process is at the ith stage, only the 
normal modes (pj(t)(l < j <i) have not decayed to zero. Thus we have 

i 

p x (t) « x = l,...,N. (5) 

3=1 

This indicates that the value p x (t) of node x at the ith stage can be represented by 
the i-dimension coefficient vector of (Pj(t), i.e., (U x \, U X 2, ■ • • , U X j, ■ ■ • , U X j). According 
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Figure 3. (color online). Left: the number of node groups satisfying the detailed 
balance condition as a function of time t. Here, the threshold for c xy (t) is set to be 
1.0 x 10~ 4 . Right: the number of non-decayed normal modes in terms of the time t. 

to equations (J2J) and (jSJ), given a threshold, we can identify the node groups satisfying 
the detailed balance condition through clustering the normalized i-dimension vectors of 
(U x i, U x2 , • ■ ■ , U x j, ■ ■ ■ , U xi ) using, for example, the fc-means clustering method. The set 
of such node groups is unchanged due to the non-decayed normal modes being fixed 
during the same stage. This means that the local equilibrium state is stable if the 
corresponding stage persists for a long time. 

Taking the H13-4 network as an example, the left panel of figure |3] shows the 
different local equilibrium states of the diffusion process and the right panel illustrates 
the different stages in terms of the number of non-decayed normal modes of the diffusion 
process. Through comparing the two panels, we see that each time one normal mode 
decays to zero, the diffusion process changes from a local equilibrium state to a new one. 
It is observed that two stable local equilibrium states with long durations emerge in the 
diffusion process. The node groups corresponding to these two states clearly reveal the 
intrinsic community structure of the H13-4 network. In addition, from figure E^b), we 
can see that no stable local equilibrium state appears in the diffusion process taking 
place on the randomized H13-4 network. This is reasonable, as it is commonly believed 
that randomized network have no community structure. All these findings suggest that 
the appearance of stable local equilibrium states in a diffusion process indicates the 
existence of community structure in the underlying network. 

Note that the diffusion occurs much more frequently within the node groups than 
among them when the diffusion process is at a stable local equilibrium state. This 
indicates that there exists a high transitive cohesion inside such node groups. The 
community structure comprised of these node groups could well reflect the diffusion 
dynamics on the underlying network. Regarding such community structure as a 
partition of a network, we measure the quality of the partition through introducing the 
conductance of networks, which reflects how easily the diffusion occurs among different 
communities. 

For a given partition V = {Vi,...,Vk}, the conductance is defined as the 
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average departure probability, PdeptiVi), of all the communities Vi, that is C(V) = 
iYli=iPdept{Vi). The departure probability of a community Vi is the probability that 
the random walker departs from V. in the next time step given that it resides at V, 
when the diffusion process is at the final equilibrium state. Formally, the departure 
probability of Vi can be computed by using 

nn _ Exev^yeVi Px(oo)p(x y) _ m _vol(Vi) 

PdeptyVi — ^ 7 — \ — 1 T7T7\ ' W 

E^y.M 00 ) voliVi) 

where p x {oo) = s x /vol(V) is the stationary distribution which characterizes the final 
equilibrium state of the diffusion process. In this way, the conductance is formally 
denoted by 

Ci^\ty^). (7) 

Actually, it can be proved that the community structure associated with the stable 
local equilibrium state can be exactly identified through minimizing the conductance 
directly. Without loss of generality, we assume that the stable local equilibrium state 
emerges at the kth stage in the diffusion process. As mentioned above, the community 
structure associated with this state can be identified through clustering the normalized 
fc-dimension vectors of (U x x, U x i, ■ • • , U X i, ■ ■ • , U X k). Further, through the matrix trace 
maximization method [27] , it can be proved that the optimization of the conductance for 
a fixed k can be done through clustering the top k eigenvectors of the transpose of the 
normalized Laplacian matrix, corresponding to the 1st to /cth columns of U. Therefore, 
the optimization of conductance provides an effective way to identify the community 
structure associated with the stable local equilibrium state. 

Now we clarify the difference between the conductance used in this paper and 
the earlier measure for the quality of network partition from the perspective of a 
random walk on networks. Firstly, as pointed out in [28], as a measure of the quality 
of network partition, the modularity can be described as the difference between the 
probability that a random walker resides in the same community on two successive 
time steps and the probability that two independent random walkers both resides in 
the same community. Secondly, in [23], through considering the random path with 
length t instead of the length 1 for the modularity and the length of infinity for the 
spectral partition, the authors proposed that the stability of network partitions be 
defined as the clustered autocovariance of the random walk. This stability provides 
a general framework for quantifying and ranking the quality of network partitions 
from the perspective of the whole network, i.e., it characterizes the fraction of the 
within-community random paths with length t with respect to all the random paths of 
length t. However, the conductance considers the quality of network partition from the 
perspective of each community instead of the whole network, i.e., it reflects the fraction 
of within-community random paths with respect to all the random paths departing 
solely from the community considered. This provides the advantage for handling the 
heterogeneous distribution of community size (or volume) which is common to real world 
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Figure 4. (color online), (a-b) The most stable local equilibrium and the different 
stages of the diffusion process on the benchmark network with the mixing ratio 
/^=0.3 and the number of communities is 30. (c) Comparison between the conductance 
optimization method and the modularity optimization method on benchmark networks. 
Each point corresponds to an average over 100 network realizations. 



networks. As follows, the application of the conductance optimization method to the 
benchmarks of Lancichinetti et al also demonstrates that our method can effectively 
handle the heterogeneous distribution of community size. 

To test the effectiveness of our method for community detection based on the 
optimization of conductance, we utilize the benchmark proposed by Lancichinetti et 
al in [29]. This benchmark provides networks with heterogeneous distributions of node 
degree and community size. Thus it poses a much more severe test of community 
detection algorithms than standard benchmarks. Many parameters are used to control 
the generated networks in this benchmark: the number of nodes N, the average node 
degree (k), the maximum node degree max_/c, the mixing ratio fi, the exponent of the 
power law node degree distribution ti, the exponent of the power law distribution of 
community size t2, the minimum community size min_c, and the maximum community 
size max_c. In our tests, we use the default parameter configuration where iV = 1000, 
(k) = 15, max_fc = 50, ti — 2, tz — 1, min_c = 20, and max_c = 50. By tuning the 
parameter /i, we test the effectiveness of our method on the networks with different 
fuzziness of communities. The larger the parameter /i, the fuzzier the community 
structure of the generated network. In addition, we adopt the normalized mutual 
information (NMI) |30j in order to compare the partition found by the algorithms with 
the answer partition. The larger the NMI, the more effective the tested algorithm. 

Figure IU(a-b) illustrate the most stable local equilibrium state and the different 
stages of the diffusion process on the benchmark network with the mixing ratio yU=0.3 
and the number of communities equal to 30. The squares along the diagonal indicate 
the predefined communities in the network. The number of these communities is clearly 
revealed by the most stable local equilibrium state. Figure HJ^c) shows the comparison 
between the conductance optimization method and the modularity optimization method 
in terms of the NMI on the benchmark network. When the community structure 
is evident, both our method and the modularity optimization method (e.g., the 
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Figure 5. (color online). Illustration of the conductance optimization method using 
the real world network, (a) The friendship network of the karate club studied by 
Zachary pQ. Colors are used to differentiate the communities uncovered by the 
conductance optimization method when considering the stage (I) in figure EJb). 
Shapes, circle and square, are used to distinguish the communities corresponding to 
the stage (II) and the real social split of this network observed by Zachary. (b) The 
different stages of the diffusion process taking place on the network in figure [5fa). Two 
most stable equilibrium states are marked with (I) and (II) . 



fast unfolding algorithm [7] and the spectral method [3]) can accurately identify the 
community structure. However, when the community structure becomes fuzzier, the 
performance of the modularity optimization method deteriorates while our method still 
achieves rather good results. 

In addition, we also tested the conductance optimization method on many real 
world networks which are widely used to evaluate community detection methods. These 
networks include the social network of Zachary's karate club, the social network of 
dolphins of Lusseau et al, the college football network of the United States [1 Oj , 
the journal index network constructed in [5], and the network of political books [3]. 
For all these networks, the conductance optimization method obtains extremely good 
results. Taking Zachary's network as an example, figure [5](b) illustrates the different 
stages of the diffusion process taking place on the network. The two most stable 
local equilibrium states are marked (I) and (II), and the corresponding communities 
are depicted in figure [5|a). Besides the stages (I) and (II), another relatively stable 
state can be also observed during the diffusion process, as shown in figure Mb). The 
corresponding three communities are respectively the one comprised of all the circle 
nodes and two communities formed by the square nodes but with different colors, as 
shown in figure G^a). Actually, as regards the three stable states, it is really hard to say 
which the best one is. In this paper, the duration time of each state may provide an 
effective candidate measure for the significance of network divisions. 

In summary, we find that several stable local equilibrium states emerge during the 
diffusion process on networks with community structure. These stable states reveal 
the intrinsic community structure of the underlying networks. We further propose a 
conductance optimization method to identify the community structure, which naturally 
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reflects the diffusion capability of the network. This work provides new insights into 
the number of communities and the multiple topological scales of complex network. As 
future work, we will study the relation and difference between the conductance and the 
modularity method. 
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